Aligning chemical structure diagrams with local search

نویسندگان

  • Matthias Hilbig
  • Matthias Rarey
چکیده

Chemists working in biomolecular application projects are usually looking at many related molecules (e.g. results of a virtual screening run, lead series development or library design). For a convenient visual analysis of this data it is essential that differences between molecules are easily detectable. This can be quite difficult if structural similarities are not taken into account while creating molecule structure diagrams. We present a method for generating globally aligned structure diagrams for two molecules following IUPAC standards [1]. Using a set of three coordinate transform operations (ring system flipping, chain flipping and substituent exchange) all correct and overlap-free layouts can be enumerated. If the number of possible layouts is too large, a heuristic is used to iterate through a smaller subspace. Subsequently all candidate layouts are scored with several different terms describing the quality of the layout (number of collisions, stretching of chains...) as well as the relationship between molecules (similarity to a template) and the one with the highest score is chosen. Scoring functions and similarity measures are easily interchangeable and the whole process is fast enough for interactive use. The whole alignment process is verified by calculating the RMSD between aligned and nearest template coordinates. For validation, the new method is applied to many clusters of related molecules from the PubChem compound library. In summary, we have developed a novel SDG algorithm which is of great help for the daily tasks of a modeller by drawing small, related molecules consistently.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of BKCa channel openers by molecular field alignment and patent data-driven analysis

In this work, we present the first comprehensive molecular field analysis of patent structures on how the chemical structure of drugs impacts the biological binding. This task was formulated as searching for drug structures to reveal shared effects of substitutions across a common scaffold and the chemical features that may be responsible. We used the SureChEMBL patent database, which prov...

متن کامل

Optimal Operation of a DWC by Self-Optimizing Control: Active Vapor Split Approach

Dividing Wall Column(DWC) offers the large potential for operating and capital cost saving in compared with conventional distillation sequence. In the studied DWC in this study, the aid of Vmin diagrams, it is shown that without a suitable value for vapor split fraction bellow the dividing wall in different operating conditions, the energy requirement increases from optimal value and it wil...

متن کامل

Automated extraction of chemical structure information from digital raster images

BACKGROUND To search for chemical structures in research articles, diagrams or text representing molecules need to be translated to a standard chemical file format compatible with cheminformatic search engines. Nevertheless, chemical information contained in research articles is often referenced as analog diagrams of chemical structures embedded in digital raster images. To automate analog-to-d...

متن کامل

A Differential Evolution and Spatial Distribution based Local Search for Training Fuzzy Wavelet Neural Network

Abstract   Many parameter-tuning algorithms have been proposed for training Fuzzy Wavelet Neural Networks (FWNNs). Absence of appropriate structure, convergence to local optima and low speed in learning algorithms are deficiencies of FWNNs in previous studies. In this paper, a Memetic Algorithm (MA) is introduced to train FWNN for addressing aforementioned learning lacks. Differential Evolution...

متن کامل

Drawing Euler Diagrams from Region Connection Calculus Specifications with Local Search

This paper describes a local search based approach and a software tool to approximate the problem of drawing Euler diagrams. Specifications are written using RCC-8-constraints and radius constraints. Euler diagrams are described as set of circles.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2012